Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Close
You're currently offline. Some features may not work.
Close
Copied to clipboard
Close
Unable to share or copy to clipboard
Close
🎮 Reinforcement Learning
Q-Learning, Policy Gradient, Reward Systems, Game AI, Robotics
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
4856
posts in
136.3
ms
Control Reinforcement Learning: Token-Level
Mechanistic
Analysis via Learned
SAE
Feature Steering
arxiv.org
·
14h
⚙
Context engineering
Playing
20 Question Game with Policy-Based
Reinforcement
Learning
arxiv.org
·
2d
🧠
AI
Show HN:
Fighting
the War Against
Expensive
Reinforcement Learning
cadenza-landing-qtu7gbjwb-akshparekh123-3457s-projects.vercel.app
·
12h
·
Discuss:
Hacker News
⚙
Context engineering
Recursive
self-improvement
from AI models
marginalrevolution.com
·
2d
·
Discuss:
Hacker News
🧠
AI
Your AI Strategy Has a
Human-Shaped
Hole
superiortech.io
·
5h
·
Discuss:
Hacker News
🧠
AI
Why AI
Breaks
Down Without Real-Time Data in Defense
Operations
singlestore.com
·
4h
🧠
AI
ashworks1706/rlhf-from-scratch
: A theoretical and practical deep dive into Reinforcement Learning with Human Feedback and it’s applications in Large Language Models from scratch.
github.com
·
2d
·
Discuss:
Hacker News
⚙
Context engineering
Show HN: A
minimal
online decision maker
decisionmaker.online
·
1d
·
Discuss:
Hacker News
👆
human-computer interaction
Part 2 - AI Chat Evaluation of the Formal Language in He
Xin
's
PEPC
System
news.ycombinator.com
·
1d
·
Discuss:
Hacker News
🤝
Multi-Agent Systems
Digitizing
the "
Shokunin
": How we encoded a Master's hammer strike into AI
yusukekaizen.substack.com
·
12h
·
Discuss:
Substack
🧠
AI
Task-Completion
Time
Horizons
of Frontier AI Models
metr.org
·
1d
·
Discuss:
Hacker News
🧠
AI
Architectural and Mathematical
Foundations
of Machine Learning: A
Rigorous
Synthesis of Theory, Geometry, and Implementation
chizkidd.github.io
·
1d
·
Discuss:
Hacker News
📍
embeddings
Cyber
Model
Arena
wiz.io
·
3h
·
Discuss:
Hacker News
🧠
AI
Schedules
of Reinforcement in
Psychology
(Examples)
simplypsychology.org
·
2d
·
Discuss:
Hacker News
⚙
Context engineering
Robots
That Can See Around
Corners
Using Radio Signals and AI
seas.upenn.edu
·
1d
·
Discuss:
Hacker News
🧠
AI
3D Tissue
Braiding
– a new,
simpler
way to build robotics
allonic.co
·
2d
·
Discuss:
Hacker News
⚙
Context engineering
Outcome
Engineering
o16g.com
·
1d
·
Discuss:
Hacker News
⚙
Context engineering
Self-Referential
Quantum Barriers for AGI
Containment
redact-app.com
·
1d
·
Discuss:
Hacker News
⚙
Context engineering
☞
Maxis
Software
Toys
arbesman.substack.com
·
1d
·
Discuss:
Substack
🧠
AI
New
ARIA
research funding programme: nearly £
50M
to secure AI agents in the wild
aria.org.uk
·
2d
·
Discuss:
Hacker News
🧠
AI
Loading...
Loading more...
Page 2 »
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help